As we have seen the changes between the features extracted from the Spotify API are small but present. However classifiers or clustering methods could be used to try and see if these changes are significant enough, such that a distinction can be made. If it is possible to find a method which is able to most of the time correctly predict to which cluster a track belongs too, then a definitive conclusion can be drawn.
By using all the features of the API, it was possible to try and find such a method. K-Nearest Neighbours (or knn), Decision Tree and Random Forest were all used to try and find the best possible classification/clustering. It was fairly obvious that the best method was Random Forest, although using kNN with a value of 7 or 15 for k both came close but still always performed worse. A 10-fold cross-validation was applied to ensure accurate and representative findings. Cross-validation is a model validation technique for assessing how the results of a statistical analysis will generalize to an independent data set. By using the 10-fold data set and the features the following values were found for precision and recall:
| Bizzey | Precision | Recall |
|---|---|---|
| With | 0.7 | 0.471 |
| Without | 0.729 | 0.873 |
A precision recall table for both the time Bizzey was a member and when he wasn’t. It was generated from the results of a Random Forest model using 10-fold cross-validation.
These results were paired with an accuracy of about 70%. This might sound good but, as is clearly visible in the table, the recall for identifying a track as with Bizzey is quite low. A low recall means that there are quite some false negatives, in this case about 5 out of every 10 predictions for the group with. Precision is also not the best for both groups, as this means 3 out of 10 predictions is a false positive. Despite the high recall for the group without, this is not a model you would want to use. Therefore a selection of the API features was made to try and improve performance. This was done through process of elimination by eliminating first the Key features, as these were hardly influential, then the lowest five features and lowest five timbre components and finally the lowest feature was removed from which the following results ensued:
| Bizzey | Precision | Recall |
|---|---|---|
| With | 0.784 | 0.659 |
| Without | 0.808 | 0.887 |
A precision recall table with an optimized selection of the Spotify API features, for both the time Bizzey was a member and when he wasn’t. It was generated from the results of a Random Forest model using 10-fold cross-validation.
Now with an accuracy of 80%, way higher precisions and recalls it can be said a fairly decent model was build which is able to most of the time distinguish between the two groups. We can better see the results on the confusion matrix, which shows the distribution of predictions made by the model against the truth of the data. Next to it is a histogram showing the importance of the selected features used in the Random Forest. There are 5 major contributors which always are loudness, acousticness, tempo, c02 and c07. Out of all these Tempo is always the most important for distinguishing the groups, this is not much to our surprise after our last tab. However acousticness and c07 are surprising since the difference in both was barely noticeable according to the previous tabs.
Yellow Claw has and probably will keep being one of my favorite DJ’s. Interestingly enough they are not the most famous Dutch DJ, but among peers they are known for having very nostalgic and good songs. It started with three members named Bizzey, Jim Aasgier en Nizzle in 2010. Only Bizzey had experience previous to joining the group, as he released 2 songs under his name in 2006. Since I stuck with the group for so long it was only logical to analyze them. I will do this using the following questions:
A few things happened over the years to the group so it will be interesting to find out if they changed due to them. To properly research these questions a self-made chronological playlist of all the songs listed in the Spotify page (except remixes) will be used, this was needed as the ‘This is’ Spotify playlist was not representative.
Personally I expect they have changed, this could be backed up by the fact that their first songs differ in quality from their first album (released in 2015). This is most likely because their first album was released under the Mad Decent, which is Diplo’s (a very famous DJ) recordlabel. However most likely the biggest change occurred in 2016 when Bizzey left the group to spend more time with his family. His main job was the MC, or master of ceremonies, which is used as an alternative indication for rappers or performing artists.
The possibility remains that there might not be a big change at all. Since they have only been active for 11 years of which they spend only 9 producing their own songs and half was spend with and the other without Bizzey. In this time frame they released four albums and a total of 115 tracks. However the amount of tracks and the time as a group are relatively small compared to artists known to have gone through different phases, such as Madonna or Cher. This means it is possible that we might not be able to discover a change due to little data, especially since the 2021 songs are rather different than the others before. Let’s not dwell on the possibilities and have a look at some of the more typical and atypical tracks they have made over the years.
Typical tracks:
These are very typical as they define their past and current style in beat and beatdrop.
Atypical tracks:
These are all tracks with a very different style then the other tracks made by Yellow Claw.
If you are interested you can check out their tracks right here. But for now let’s try to discover if they have changed.
This plot shows the difference in speechiness and liveness in the time Bizzey was and wasn’t part of the group. One can clearly see that after Bizzey left the groep the speechiness contained more variety and got a higher mean. Also the liveness became more apparent. It is strange to see a change in the speechiness since Bizzey was a MC. You would expect the departure of a MC to decrease the speechiness of produced songs, but this is not the case here. One would expect it because rap music is indicated by Spotify to have a speechiness between 0.33 and 0.66. While values above 0.66 are indicated to describe tracks that are probably made entirely of spoken words, and values below 0.33 most likely represent music and other non-speech-like tracks. Therefore the low values are suprizing altogether but an MC, or rapper, leaving leading to an increase in the mean is at least to be called peculiar. One can also note an increase in liveness in songs after Bizzey left, whilst none of the songs are life performances. Eventhough none of the songs pass the 0.8 barrier set by Spotify to qualify as a live recording, but it can indicate a change to the music. This since clearly the algorithm notices something different to indicate the higher liveness.
This plot shows the difference in acousticness and liveness in the time Bizzey was and wasn’t part of the group. One can clearly see that after Bizzey left the groep the acousticness contained more variety and got a higher mean. Also the liveness became more apparent. It is strange to see a change in the acousticness since Bizzey was a MC. You would expect the departure of a MC to decrease the acousticness of produced songs, but this is not the case here.
Here we can see the chromagram and self-similarity matrices of Loudest MF, which scored very high on speechiness.
This is a 30 second sample from the track Loudest MF.
Chromagram
You can see the fade away of sound between 28 and 36 sec as only G# is present which is the muffled tone. After this it is clear that there is a distinct beat since it is visible in A/Ab and E/Eb and slightly in C#/C. This until around 50 sec where the rapper starts and the spectrum contains more yellow and thus there are more frequencies present until around 65 sec. You can see and hear at 85 sec the second beat drop, with a lot of clean notes being played until 115 sec. Also you can see at 105 sec is where the rapper says ‘bounce’ a couple of times as the C# is faded in the C. Then the rapper starts again as visible in the amount of present frequencies. At 155 sec the third beat starts decently clean as most is present in G# but around 160 seconds muffled sound is added to the beat and the beat gets more components from then on. However it is still clear that the music fluctuates between high and low.
Self-similarity matrices
You can clearly see the most points mentioned above in the chroma-based self-similarity matrix. For example the high and low fluctuations of the same beat from 150 sec to the end of the song is depicted very clearly. However from both matrices it is clear there was not alot of repetition found by Spotify.
Here two songs are compared to Loudest MF, namely DJ Turn It Up and Amsterdamned. The first is a typical track that dates back to when Bizzey was still a member of Yellow Claw. Whilst the second is a typical track that was made without Bizzey.
DJ Turn It Up
This is a 30 second sample from the track DJ Turn It Up.
From the timbre-based self-similarity matrices a pattern of the song becomes clear. It is visible that from 20 sec to 95 sec in the song the same occurs as from 95 sec to 160 sec. This coincides more with the typical yellow claw style of some repetition in their song, than was the case with Loudest MF.
Amsterdamned
This is a 30 second sample from the track Amsterdamned.
Both the timbre-based and chroma-based self-similarity matrices visualize a very repetitive track. It is visible that from 45 sec to 95 sec in the song the same occurs as from 125 sec to 175 sec. But also from 20 sec to 45 sec the track is repeated in 95 sec to 125 sec. This also coincides more with the typical yellow claw style of some repetition in their song, than was the case with Loudest MF.
Spotify assigns each track a key that was mainly used according to their feature analysis. In this graph, one can see all the main keys of the tracks before and after Bizzey left the group. There clearly is a difference in the key use after Bizzey leaves. His departure brought way more tracks in F both major and minor, and more tracks in C# major. Also the graph is more shifted towards the A, A# and B this in the minor of those keys. Most tracks made with Bizzey are played in both G major and minor, however they are tied with a lot of other keys. One can also note how the G minor key is used less often after he left the group.
In this section two graphs are computed using lower-level track audio analysis from Spotify. The distinction is again made between the era where Bizzey was a member and the era where he wasn’t.
Timbre coefficients
In this graph one can hardly distinguish between the two time era of Yellow Claw, a few coefficients are slightly different after Bizzey left for example. But these differences so small that no conclusions can be drawn from them. However there is one big difference, the variance in c02 coefficient which is known as the loudness or brightness of a track. Here it is visible that the timbre loudness has a range of nearly 100 when Bizzey was still in the band. However after his departure the range nearly doubles as it gets close to a range of 200.
Pitch classes
Yet again one can hardly distinguish between the two time era of Yellow Claw and only a few coefficients are slightly different after Bizzey left for example. These differences are also to small that no conclusions can be drawn from them. However one could argue there is one difference, namely in the C pitch. It is visible that the range, mean and deviation of this pitch are different after Bizzey left.
Both plots are distributed over the time period with and without Bizzey
Tempo distribution
After Bizzey left Yellow Claw had a significant shift in tempo distribution. This is because as is visible in the graph tracks were made with tempo above the 150 bpm. And not just a few tracks, almost half of their new music would be above this previous apparent threshold.
Loudness distribution
The loundess decreased after bizzey left.
Yes they have changed.